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Multiplicative Weights 

In this lecture we will introduce Multiplicative Weights, a simple technique with many applications. We 
start with an example. 

Example Suppose Mr. X wants to bet on football games but docs not know much about football himself. 
Before each game, X can check the predictions of n experts. Is there an algorithm that allows Mr. X to 
perform well in the long run? 
Two potential ideas are: 

(1) For each game, bet according to what the majority of experts predict 

(2) Wait a few games to sec which of the experts get it right most of the time and then follow their advice 

These strategics work well in some cases but not in others: (1) fails when only a few experts make good 
predictions, and (2) fails when there is an expert that performs well for the first few games and then never 
makes a correct prediction again. Instead, we will consider a combination of the two approaches: for each 
game, we will consider the opinion of all experts, but each expert's opinion will be weighted according to 
his past performance. More precisely, let w\ denote the weight of expert i after t games, and consider the 
following algorithm: 

1. Set tv® = 1 for i = 1, n 

2. Make a prediction for game t based on a weighted majority of experts where expert i gets weight 

3. After game t update the weights as follows: if expert i's prediction for game t was wrong then set 
w\ = (1 — e)w' _1 ; otherwise set w\ — u>- _1 

For this algorithm, we have the following: 

Theorem Let to* denote the number of mistakes that expert i makes in the first t games and to* denote 
the number of mistakes that Mr. X makes in the first t games. Then for all i and t, 

m t < 2 ^ + 2(l + e)^ 
and in particular, this holds for the i that minimizes to*. 

Proof Define <& fc = J2i w i- If M r - X makes a mistake at game k, then a weighted majority of the experts 
must have made a wrong prediction for game k. The weights of all these experts drop by a factor of (1 — e) 
and so we have <& < (1 — e/2)<j> fc_1 . Then over the first t games we have 

$* < (1 _ £)"V = ^ _ £)m< 

On the other hand we have w\ = (1 — e) m * and so 
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$* > w\ = 




Therefore, 




Rearranging this inequality gives 



to* < 



log(n) 



'^(l -e/2) 



-io 5 (l - e/2) 



This bound is slightly stronger than the one in the statement of the theorem. Using the inequalities 
e/2 < —log{\ — e/2) and e + e 2 > —log{l — e) converts it to the required form and completes the proof. 

Next, we will modify our algorithm to get rid of the factor of 2 on the right hand side of the bound above. 
Consider the following: 

1. Set = 1 for i = 1, n 

2. To make a prediction for game t, do the following: for i = l,...n, follow expert i's prediction with 
probability p\ = w'r 1 / £\ w'f 1 

3. After game t update the weights as follows: if expert i's prediction for game t was wrong then set 
w\ = (1 - e)w\~ 1 else set wj — w' -1 

For this algorithm, we have the following: 

Theorem Let to* denote the number of mistakes that expert i makes in the first t games and let m* denote 
the random variable equal to the number of mistakes that Mr. X makes in the first t games. Then for e < 1/2 
and for all i and t, 



and in particular, this holds for the i that minimizes to*. 

The proof of this Theorem is similar to before and we will omit it. Instead, we will introduce our most 
general version of the multiplicative weights algorithm. In the example above, we had only two possibilities 
for the relation between event outcomes and expert predictions: the outcome of game t either matched expert 
i's prediction or it did not. Our measure of performance for individual experts and for the algorithm as a 
whole was simply counting wrong predictions. We want to generalize the algorithm to allow for an arbitrary 
set P of possible outcomes to events. In this setting, we will measure the performance of the algorithm as 
follows: we will say that at each step, following expert i's prediction when the true outcome is j incurs a 
penalty of M(i,j). More precisely, we have the following: 

0. The input of the algorithm consists of: a set P of possible outcomes to events. For i = 1, ...n and for 
jePa number M(i,j) from the interval [—1, p]. We will refer to p as the width; we will also have the 
restriction I < p. 

1. Set = 1 for i = 1, n 

2. To make a prediction for event t, do the following: for i = l,...n, follow expert i's prediction with 
probability p\ = w]^ 1 / J2j w ) 1 



log(n) 



+ (l + e)TO* 



e 
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3. Let j* denote the outcome of event t. Update the weights as follows: 

t = , wl~\l - e ) M ^yp ifM(i,j*) > 
Wl ~ 1 wl-\l + e)- M ^ t yp ifM(i,j*) < 

A similar analysis to before gives: 

Theorem Let D t denote the probability distribution {p\, . . . ,p f n } with which we pick experts to make a 
prediction for event t. Let M{D t denote the expected value of our penalty when following the distribution 
D l for event t and when the actual outcome is j*. Then for e < 1/2 and for all T and i, 

£ M( ^)<^M + (l + e ) £ M(i,f) + (l-e) £ M(i,f) 

t=l ' t:M(i,j*)>0 t:M(i,j*)<0 

Corollary For any S, for e < mm(l/2, 5/Ap), for T = 16p 2 log(n) /5 2 rounds and for all i, the average 
penalty we get per round obeys: 

T T 

and in particular our average penalty per round is at most 5 bigger than the average penalty of the best 
expert. 

Applications of Multiplicative Weights 

Our first application of the Multiplicative Weights algorithm will be to zero-sum games. In a zero-sum game, 
we have a row player, R, and a column player, C. If R plays strategy i and C plays strategy j, then R pays 
C the amount M(i,j). Players can also play mixed strategies, i.e. probability distributions over the sets of 
pure strategics. We will extend our payoff notation so that M(D,P) denotes the expected amount that R 
pays C when R plays the mixed strategy D and C plays the mixed strategy R. Recall that von Neumann's 
Minimax Theorem states that 

miriD maxj M(D,j) = maxp miriiM(i, P) 
We will denote the above quantity by A; it is known as the value of the game. 

We are now ready to state the zero-sum game problem: given the sets of strategies for R and C and the 
payoffs M(i,j), estimate the value of the game A. Our approach will be to associate elements of the current 
problem to appropriately chosen elements of the Multiplicative Weights algorithm, then directly apply what 
we already know about Multiplicative Weights to conclude that we do indeed get a good approximation to 
A in a reasonable amount of time. The details of the argument will be presented next lecture. 
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